# RAM Control Block Documentation

Michael Li — ml1811@ic.ac.uk

## 1 FSM Design



The purpose of this block is to accept the various commands from upstream and then write the changes into vram. It is however more efficient to write the entire word into vram in one go, so we will cache all the changes made to the same word and only when we are given a command to draw in a different word do we write out the changes into vram. This block uses the existing px word cache and ram fsm. Ram fsm will need to be modified to take an additional input (from the cache) as well as some combinatorial logic to then merge the changes saved in the cache, with what is in vram. This writing out to ram takes 3 cycles to do and we need to stall the controlling fsm till this is complete.

FSM fires control signals denoted \_trig, any flags are denoted \_ and are registered such that they persist across state changes. Processes corresponding to the trigger signals are all conditionally executed based on the respective trigger signal being asserted by enclosing the entire body in a giant IF.

# 2 FSM Operation

FSM has the following states:

s\_error: trap state to help debugging. Can only leave with a reset command

s\_idle : reset initialises FSM to this state.

IF idleCounter = N,  $goto s_flush$ 

Otherwise stay in same state untill startcmd.

s\_checkrange : checks to see if target pixel is in current loaded word
if out of range goto s\_flush

otherwise decode and goto either draw or clear. This uses a pair of functions defined in helper\_funcs that convert x y coordinates into ram word adress and ram bit number

s\_draw: check for move\_trig or draw\_trig and signals the px\_draw process to start

An idea would be as draw will always complete in one cycle, that we can have more logic that reads in a new sample while we are in draw, as we know it will finish in one cycle, by the time we could fetch and decode the next instruction, some kinda of pipelining?

s\_clear :to be implemented, currently goes to idle

s\_flush:triggers flush process with flush\_trig otherwise goes to waitram state s\_fetch\_draw: triggers the draw\_px process with control signal fetchdraw.

s\_waitram :Waiting for ramfsm to finish the rwm operation. The next transition is based on which state it came from if vram done writing

A flag is stored in the registered fetchdraw flag process which is high if it came from rangecheck, instead of idle. The FSM transitions to to fetchdraw if the flag is high, otherwise it goes to idle.

RESET will force the FSM back to the idle state on the next clock edge, default state transition is to error to help debugging. All intentional transitions should be explicitly specified.

Inputs:state, dbb\_bus.startcmd, curr\_vram\_word, draw\_trig,vram\_done
fetch\_draw\_flag, idle\_counter

Driven outputs: idle\_counter\_trig, draw\_trig, move\_trig, flush\_trig, fetch\_draw\_trig

#### 3 Processes

### 3.1 state\_transition (comb)

Combinatorial transition matrix for fsm. Computes the next state to transition to and asserts control signals to fire processes. When not in this state, dbb\_delaycmd is high.

**Driven outputs:** next\_state, idle\_counter\_trig, draw\_trig, move\_trig, flush\_trig, fetch\_draw\_trig, dbb\_delaycmd

## 3.2 fsm\_clocked\_process (clocked)

Clocked part of fsm for transitions and registered flags.

Driven outputs: rcb\_finish, fetch\_draw\_flag

#### 3.3 idle\_counter (clocked)

Handles the counter to store the number of idle cycles elapsed.

Driven outputs:idle\_counter.

### 3.4 current\_word\_register (clocked)

Stores the current word address loaded into cache, so we can check if the new command is in this range.

**Driven outputs:** curr\_vram\_word.

## 3.5 draw\_px (comb)

Does the draw single pixel if draw\_trig is high, if in range. Sets the address for pixel cache and enables single write by setting pw =1, wenall=0 and pixopin to the correct value based on the draw command. If fetch\_draw trigger is high then does draw and clears out the cache. The duplicated draw colour decoding could be optimised perhaps with a full decode stage that will store all the relevant flags and commands in a set of registers could be more efficient or at least prevent code duplication?

**Driven outputs:** pxcache\_pxopin, pxcache\_pixnum, pxcache\_wen\_all, pxcache\_pw.

#### 3.6 flush\_cache (comb)

Writes out the changes in the pixel cache to vram. This process triggers the ram\_fsm entity to begin the interfacing. I will need to modify ram\_fsm to take an addition input, connecting the output of px\_cache to this new input of ram\_fsm. Add a new combinatorial process that will apply the changes from cache into what was read from memory by combining vdout with store\_ram. If the value in cache is a explicit colour, I write that to vram, otherwise I either invert or leave the same. If the if\_same signal is asserted, then I have no need to write to vram as there have been no changes made in the cache.

**Driven outputs:** ram\_addr, ram\_start.

#### 4 Functions

### 4.1 getRamWord

This function takes in an x and y coordinate from dbb\_bus and computes the corresponding word address containing that pixel specified returning it as an 8 bit vector. This value is then used to check if it will be a "cache hit" and also passed to the ramfsm.

#### 4.2 getRamBit

This function takes in an x and y coordinate from dbb\_bus and computes the corresponding bit number in the word that is then used to address the pixel cache.

#### 5 Other notes

rcb\_finish asserted inside clocked fsm proc

There is scope for potential parallelisation of commands. Seeing as it takes 3 clock cycles to complete a standard draw due to the state transitions, we could pipeline the instruction decode in some way to take advantage of that as long as we're not in the cache flush stage.